THE GENERALIZED WIELANDT INEQUALITY IN INNER 

PRODUCT SPACES 
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Abstract. A new inequality between angles in inner product spaces is for- 
mulated and proved. It leads directly to a concise statement and proof of the 
generalized Wielandt inequality, including a simple description of all cases of 
equality. As a consequence, several recent results in matrix analysis and inner 
product spaces are improved. 



1. Introduction 

The Wielandt and generalized Wielandt inequalities control how much angles 
can change under a given invertible matrix transformation of C". The control is 
given in terms of the condition number of the matrix. Wielandt, in |13j . gave 
a bound on the resulting angles when orthogonal complex lines are transformed. 
Subsequently, Bauer and Householder, in |Tj, extended the inequality to include ar- 
bitrary starting angles. These basic inequalities of matrix analysis were introduced 
to give bounds on convergence rates of iterative projection methods but have found 
a variety of applications in numerical methods, especially eigenvalue estimation. 
They are also applied in multivariate analysis, where angles between vectors corre- 
spond to statistical correlation. See, for example, [I], [5], [6], [7] and [8]. There are 
also matrix- valued versions of the inequality that are receiving attention, especially 
in the context of statistical analysis. See [2], [10], [12], and [IB] . 

The condition number of an invertible matrix A is n(A) — ||.A||||A ||, where || • || 
denotes the operator norm. If A is positive definite and Hermitian, k(A) is easily 
seen to be the ratio of the largest and smallest eigenvalues of A. The following 
statement of the generalized Wielandt inequality is taken from [8] . 

Theorem 1.1. Let A be an invertible nxn matrix. Ifx,y€C n and $, ^> € [0, 7r/2] 
satisfy 

\y*x\ < ||ac||||j/||cos$ and cot(*/2) = k(A) cot($/2), 

then 

\(Ay)*(Ax)\ < ||i4a||||Atf||coB*. 

The generalized Wielandt inequality can be difficult to apply for several reasons. 
First, despite having various equivalent formulations, the inequality seems always 
to be expressed in ways that hide the natural symmetry coming from the invertible 
transformation involved. Next, the conditions for equality are known, see [9], but 
are unwieldy and hard to apply. Finally, the angles involved are angles between 
complex lines rather than between individual vectors. 
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Although the last point seems minor, we found it to be the key to a symmetric 
formulation and a simple description of the cases of equality. In Theorem l2.4l and its 
matrix analytic counterpart, Theorem 13. 1[ we present a new inequality that gives 
sharp upper and lower bounds for the angle between a pair of transformed vectors. 
The conditions for equality are simple and easy to apply. This new inequality 
relates angles between vectors rather than between complex lines but it immediately 
implies a result for angles between complex lines that is equivalent to the generalized 
Wielandt inequality Moreover, this version of the generalized Wielandt inequality 
retains the simple form of the new inequality and (most of) the simplicity of its 
conditions for equality. 

In Section [2] we work in the context of an arbitrary real or complex vector space 
having two inner products. This approach preserves symmetry by avoiding the 
distinction between angles before and after a fixed transformation. Also, the main 
result is not restricted to C™ but holds for vectors in infinite-dimensional spaces. 
As an application of the unrestricted result, we improve a metric space inequality 
from [4] . The main results are then formulated in the language of matrix analysis 
in Section [3j and we apply them to improve inequalities from |15j and [11] . and to 
settle a conjecture from [14] . 

To begin, a short discussion of angles in inner product spaces is in order. In 
a real inner product space (V, (•,•)) the angle 9 — 9(u,v) between two non-zero 
vectors is defined by, < 9 < ir and 



Here ||w|| = y (u, u) is the norm induced by the inner product. The angle between 
subsets S and T of V is the infimum of the angles between non-zero elements of S 
and T, so 

&(S, T) = mi{6{u, v) :0^ue S,0^v eT}. 

With this definition it is easy to check that the angle = 0(Ru,Rti) between the 
lines Ru and Rv satisfies < < ir/2 and 



A complex inner product space (V, (•, ■)) may be viewed as the real inner product 
space (Vr, Re(-, •)) where Vr — V with the scalars restricted to R. Since Re(v, v) = 
(v,v) for all v € V, lengths in V are preserved and therefore so are angles. Thus, 
this real inner product is used to define the angle 9 between the vectors u and v, 
and a computation gives the formula for the angle between the complex lines Cu 
and Cv. We have, 



The second formula is often used as a definition of the angle between vectors u and 
v in a complex inner product space. (Angles defined this way do not determine an- 
gles in triangles correctly but they have the advantage that complex orthogonality, 
namely (u, v) = 0, is equivalent to the angle between u and v being 7r/2.) 
We will make use of the simple observation that if \a\ = 1, then 



cos 9 — 



Re(u, v) 



and cos© = 




(1.1) 0(Ou, Cv) — 9(au, v) if and only if \(u, v) | = a(u, v). 
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(Note that our inner products are taken to be linear in the first variable.) The 
above observation remains valid for 0(Ru, Ru) in a real inner product space, where 
a = ±1. 

2. Main results 

Suppose V is a non-trivial real or complex vector space. Let (•, -)i and (•, -) 2 be 
inner products on V and define to, V m , M, Vm, E\ and E 2 by, 

m = inf ||u|| 2 /||«||i , V m = {v GV : \\v\\ 2 — to||v||i}, 
o=ivev 



(2.1) 



M= sup H|a/|M|i, V M = {v£V:\\v\\ 2 = M\\v\\ 1 }, 

( , , U V U V 

E = E j = \ (u, v) : — ip + — — G V m , j—r- - j—r- £ V M 



for j = 1,2. Here, as usual, ||w||i = (v, vj[ and ||w|| 2 = \J (v, v) 2 . We anticipate 
the result of Corollary 12.31 in the definition of E above. 

Evidently < m < M < oo, £ V m and G Vm- (The convention • oo = 
ensures that G Vm when M = oo.) A standard compactness argument shows that 
if V is finite dimensional then < m < M < oo and V m ^ {0} ^ Vm- If to = M 
then V m = Vm = V and, by polarization, (u, v) 2 = m 2 (u, v)i for all u, v G V. 

Lemma 2.1. Let V be a real vector space equipped with inner products (•, -)i and 
(■,-) 2 . Make the definitions i2. 1)) . If to < M, then V m and Vm are subspaces and 
the two are mutually orthogonal with respect to both inner products. 

Proof. Suppose it is a non-zero vector in V m and v G V is not a multiple of u. 
Then 

f(f) = \\u + tv\\l = (u 1 u) 2 + 2t(u,v} 2 +t 2 (v,v) 2 
U \\u + tv\\\ (u,u) 1 +2t(u,v) 1 +t 2 (v,v) 1 
is defined and diffcrcntiable for t g 1. Since / achieves its minimum value at t = 0, 
/'(0) = 0. That is, (u,v) 2 (u,u)i = (u,u) 2 (u,v)i- Thus, for all u G V m and all 

veV, 

(u,v) 2 = m 2 (u,v)i. 

(The excluded case, u = or v a multiple of u, is easily verified.) It follows that if 
v G V m then / is the constant function with value to 2 . In particular, f(l) — to 2 , so 
u + v G V m • Since it is clearly closed under scalar multiplication, V m is a subspace. 

Repeating the argument for Vm shows that it, too, is a subspace and that for all 
v G Vm and u G V, 

(u,u}a = M 2 {u, v)\. 

If u G K„ and w G Va/ then m 2 (u,v)i = (u,v) 2 = M 2 (u,v)i and hence (u,v)i = 
(u,v) 2 = 0. Thus it and v are orthogonal with respect to both inner products. This 
completes the proof. □ 

Corollary 2.2. Let V be a real vector space equipped with inner products (•, -)i and 
(•, -) 2 . Make the definitions 12.1]) . IfV is two-dimensional, then there is a basis of 
V that is orthogonal with respect to both inner products. 

Proof. If m = M then the two inner products are multiples of each other and 
any orthogonal basis will do. Otherwise, let ^ b G V m and 7^ B G Vm- Then 
{b, B} is the desired basis. □ 
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The next result justifies the use of E to denote either E\ or E 2 . 

Corollary 2.3. Let V be a real vector space equipped with inner products (•, -)i and 
(■, -)2. Make the definitions h2.1\) . Then E\ = E 2 . 

Proof. By symmetry it is enough to show that E\ C £ 2 . For (u, v) € E\, let 

u v u v 

e V m and W = j—r, TT^T G Vm • 



\Hi \\v\\x \\u\U \\v\U 

By Lemma |2~T1 w and W are orthogonal with respect to (•, -)2, so 

IMI3/NI! = + w\\l = KIMI2 + \\w\\ 2 2 ) = \\\w-w\\l = \\v\\l/\\ 

Thus 

- -w G V m and rr-r, —— = t—t-W € V m 



li • 



|M|2 ||«||2 \\u\\2 ||u||2 \\v\\2 \\u\\ 2 

and so (u,v) € E 2 . □ 

Having two inner products, the space V has two differing notions of the angle 
between vectors. Our main result provides a comparison between these angles in 
terms of the quantities m and M defined in ([2.1J . 



Theorem 2.4. Let V be a real or complex vector space equipped with inner products 
(•, -)i and (•, -)2- Make the definitions h2.1\) . For independent vectors u and vinV 
let <p and ip be defined by, < (p < n, < ip < ir, 

Re(u,v)i , , Re(u,v)2 
cosip = and cos?/; = . 

IMIlHIl ll u l|2||w||2 

Then 

(2.2) (m/M) tan(yj/2) < tau(t/>/2) < (M/m) tan(^/2). 

Equality holds in the right-hand inequality if and only if (u,v) £ E. Equality holds 
in the left-hand inequality if and only if (u, —v) G E. 

Proof. First consider the case that V is a real vector space. Note that the 
assumption of independence ensures < ip < tt and < ip < it. 

By Corollary 12. 2[ the span of u and v has a basis {b, B} that is orthogonal 
with respect to both inner products. Without loss of generality we may assume 
that H&lli = ||-B||i = 1. For notational convenience, set n = H&H2 and N — \\B\\2 
and suppose, by interchanging b and B if necessary, that n < N. Note that the 
definitions of m and M ensure that m < n and N < M. Write u = Ubb + ubB 
and v = vt>b + vbB for some real numbers Ub, ub, Vb> and vg- In terms of these 
coordinates we have, 

\\u\\t\\v\\lsm\ = \\u\\l\\v\\i-{u,v)t 

= (ul + u B ){vl + v%) - (u b v b + u B v B ) 2 

= (u b v B - u B v b ) 2 

and 

Nl5lHllBin 2 v = ||u||liHil -<«,«)! 

= {n 2 u 2 b + N 2 u 2 B ){n 2 v 2 b + N 2 v 2 B ) - (n 2 u b v b + N 2 u B v B ) 2 
= n 2 N 2 (u b v B - u B v b ) 2 . 
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Thus, 

(2.3) ||u||2|H|2SinV> = niV||u||i|H|i sin<£. 

The derivative of 

g(x) = (u 2 + xug) 1 / 2 ^ + xv B ) x l 2 + (ubvt + xubvb) 



is 



1 



V4 / 2 , _,2 \ 1/4 



2 



2 V W+™|/ W+^I/ J 

so ,g(l) < g(N 2 /n 2 ). Multiplying both sides of this by n 2 gives, 

(2.4) n^luHilMliCl + cosp) < ||u||a|H|a(l + cos^). 

Combining (|2.3|l and (|2.4[) gives, 

. . . , simp nN sirup . , 

2.5 tan V/2 = ^— r < ^— = (N n) tan(<p/2), 

v ; v ' 7 (1 + cos?/;) ~ n 2 (l + cos^) v ' y v ' 7 

with equality if and only if g'(x) = for x G (1, N 2 /n 2 ). Since m < n < N < M, 
(I2.5[) proves the right-hand inequality of (|2.2[) . 

If equality holds in the right-hand inequality of (|2.2[) , then equality holds in (|2.5[) 
and n — m, N — M, b € K„, and £? € Vjvf. If to = M then V^j = Vm — V and 
ip = ip so the last two statements of the theorem are trivial. Otherwise, equality in 
(|2.5p implies that g' is zero on the non-trivial interval (1, M 2 /m 2 ). That is, 

V4 /„,2 , „„.2 \ V4 



and hence u^v 2 = w^it^. Since u and i» are independent, both ub and «b are 
non-zero, they have opposite signs, and usVb = —v B Ub- Therefore, 

v Ubb + usB Vbb + vsB 




b T U B 



\\ V \U yju 2 + u 2 B y/% 

(ub/u B )b + B (vb/v B )b + B \ _ 2(u b /u B )b 

^ t . . . ^ — *m 



and 



y/(u b /u B ) 2 + 1 y/(v b /v B ) 2 + 1 J y/(u b /u B ) 2 

v Ubb + u B B Vbb + v B B 



Mil H|l y/ U 2 + U% y/vf 



b ^ U B 

= ± / (m,/u B )6 + .B + (Vi; B )b + g \ = ± 2B ^ y 

VVTV^F+T y(v^F+T/ vTW^F+T A/ ' 

That is, (w, u) € #i = E. 

Conversely, suppose that (u, v) € J5, set 

U i> u v 

w = Ti u + 7f-j7- e Vm and VV = — — — — G Km, 

hlli Nil ||w||i IM|i 

and observe that w + W is in the direction of u and u? — W is in the direction of 
v. By Lemma [2.11 w and W are orthogonal with respect to both inner products. 
Thus, 

(w + W,w- W)i _ \\w\\l - \\W\\\ 



COS ip : 



Ww + wuw-wh IIHIf + ini? 
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and 

2 1-cosy 

tan (p 2) = - = - — F . 

l + cosy> 

A similar calculation yields the corresponding formula for ip and leads to the con- 
clusion, 



\Wg M 2 \\W\\l 
Nil m 2 ||u;|| 2 



taxf(ip/2) = ^— = — ' 1/ = (M/m) 2 tan 2 (^/2) 



Taking square roots establishes equality in the right-hand inequality of 

Applying the right-hand inequality of (|2.2I) to the vectors u and — v replaces ip 
by 7r — if and ip by 7r — ip to give the conclusion, 

cot(^/2) = tan(vr/2 - ^/2) < (M/m) tan(?r/2 - p/2) = (M/m) co%/2). 

This proves the left-hand inequality of (|2.2j) . with equality if and only if (u, —v) e E 1 . 
This completes the proof in the case that V is a real vector space. 

If V is a complex space and and (-,-)2 are complex inner products, the 

conclusion of the theorem follows by applying the result just proved to the real 
vector space Vk equipped with the real inner products Re(-, -)i and Re(-, ■)%. This 
completes the proof. □ 

The angle between two subsets of V is defined as an infimum of angles between 
pairs of vectors. The inequality (12.21) remains valid when we take an infimum of all 
three terms so we have the following result. Note that since the cosine function is 
decreasing, the cosine of an infimum of angles is achieved by taking the supremum 
of their cosines. 

Corollary 2.5. Let V be a real or complex vector space equipped with inner products 
(•, -)i and (•, -)2. Make the definitions \2. 1)) . For S, T C V, each containing at least 
one non-zero vector, let $ and \& be the angles between the subsets S and T with 
respect to (•, -)i and (•, ■)%, respectively. That is, < $ < ir, < ^ < tt, 

. . _ Re(u, v)i , T Re(u,v) 2 
(2.6) cos$= sup , and cos* = sup . 

0=iu€S IMMMIl Q^ueS ll«l|2||w||2 



The 



(™/M)tan($/2) < tan(*/2) < (M/m) tan($/2). 



The following theorem is our version of the generalized Wielandt inequality in 
inner product spaces. As pointed out earlier, the angles between the (real or com- 
plex) lines determined by u and v are often taken as alternative definitions of the 
angle between vectors themselves. We show that with this definition the results 
of Theorem 12.41 still hold, but the conditions for equality become slightly more 
complicated. 

Theorem 2.6. Let V be a real or complex vector space equipped with inner products 
(•, -)i and (•, -)2- Make the definitions For independent vectors u and v in V 

let $ and * be defined by, < $ < n/2, < * < n/2, 

\(u, v)\ I I (u, 11)2! 

cos$ = , V ' / 11 and cos*- IV ' 71 



1MI1IMI1 IMI2IMI2' 

Then 

(2.7) (m/M) tan($/2) < tan(*/2) < (M/m) tan($/2). 
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Let at\ and a.2 be solutions to \(u, v)i\ — ai(u, u)i and \(u, v)?] — ct2{u, v)2- Equality 
holds in the right-hand inequality of \2. 7[ ) if and only if (aiu, v) G E and either 
ol\ = U2 or (u,v)2 = 0. Equality holds in the left-hand inequality of if and 

only if (a^u, —v) € E and either a± = cx2 or (u,v)i = 0. 

Proof. Apply Corollary ED to the lines S = Cu and T = Cv {S = Ru and 
T = Rv in the real case) to obtain (|2.7j) . By $ is the angle between a\u and 

v with respect to (•, -)i and $ is the angle between a.2U and v with respect to (•, -)2- 
To analyse the right-hand inequality of (|2.7[) , let 9 be the angle between a,yu and v 
with respect to (•, -)2- The infimum definition of "J and Theorem I2.4I show that 

(2.8) tan(*/2) < tan(0/2) < (M/m) tan($/2). 

By (jl.ll) . the first of these is equality if and only if either ot\ — a.2 or (it, 11)2 = 0. 
By Theorem 12.41 the second is equality if and only if {a\u, v) G E. Thus equality 
holds in the right-hand inequality of ()2.7[) if and only if (aiu, v) <G E and either 
ax = a.2 or (u, v)2 = 0. 

To analyse the left-hand inequality of (|2.7|) . let 8 be the angle between a%u and 
v with respect to (•, The infimum definition of 5> and Theorem l2.4l show that 

(2.9) (m/M) tan($/2) < (m/M) tan(0/2) < tan(*/2). 

By (jl.ll) . the first of these is equality if and only if either a± — 012 or (u,v)\ = 0. 
By Theorem 12.41 the second is equality if and only if (at2U, —v) G E. Thus equality 
holds in the left-hand inequality of (|2.7p if and only if (a^u, —v) G E and either 
Q'l = a.2 or (u, v)i = 0. □ 

The inequalities (|2.2[) and (|2.7p can be expressed in various equivalent forms. In 
terms of cosines (|2.2p becomes, with \ = [M 2 — m 2 )/(M 2 + m 2 ), 

(2.10) < cos^ < 



1 — x cos <P 1 + X cos f 

Replace ip and V> by $ and to get the expression for (|2.7p . In terms of inner prod- 
ucts instead of angles, the inequalities (|2.2p of Theorem 12.41 and (|2 . T[) of Theorem 
2.61 become, in the case ||u||i = |jw||i = 1, 



^ ^ -x + Re(u,^)i < Re(M,v) 2 < X + Re(u, u)i 



l-xRe(ti,u)i ~ IMI2IMI2 ~~ l + xRe(M,u)i 
and 

/ 212 n -x+Km,^)!^ |(m,«) 2 | < X+|(u,«)i| 



1-Xl(w,w>i| |M| 2 |M| 2 
respectively. 

The special case $ = n/2 in Theorem 12.61 gives an inner product formulation of 
Wielandt's inequality that includes all cases of equality. Note that the right-hand 
inequality of (|2.12l) is equivalent to the left-hand inequality of (12. 7p . 



Corollary 2.7. Let V be a real or complex vector space equipped with inner products 
(•, -)i and (■, -)2- Make the definitions \2.1)) . Suppose the non-zero vectors u, v G V 
are orthogonal with respect to and a satisfies |(u,w) 2 | = a{u,v)2- Then, 

, 2n > \{uM M 2 ~m 2 

1 ' ' IHHMI2 ~ M 2 + m 2 

with equality if and only if (au, —v) G E. 
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The following theorem gives upper and lower bounds on the difference between 
the cosines of p and ip. It improves the estimates given in Theorems 1 and 2 of [4]. 

Theorem 2.8. Let V be a real or complex vector space equipped with inner products 
(•, ')x and (•, Make the definitions 12.1]) . For independent vectors u and v in V , 

2 M -to < Rc(u,v) 2 Re(u,v)i <2 M -m 

M + m - IMI2IMI2 ||w||i||«||i ~~ M + m 

and, if Re(u,v)i > 0, then 

Re(u,v) 2 Re(u,-u)i M 2 — m 2 

■ ' HHMI2 IM|i|M|i ~ M 2 + m 2- 

Also, 

(0 Ifil M 2 -m 2 |( M ,u) 2 | Kiyu)ij M 2 — m 2 

1 ' ' M 2 + m 2 ~ \\u\\ 2 \\vh \H\1\H1 ~ M 2 +m 2 ' 

Proof. Suppose p> and ip are the angles between u and v with respect to (•, -)i 
and (•, -) 2 . Since, 

cos?/' - cos<yj = 2/(1 + tan 2 (V>/2)) - 2/(1 + tan 2 (^/2)), 



Theorem 12.41 gives 
2 



< cos t/> — COS ip < 



l + {M/m) 2 x l + x~ l + (m/M) 2 x 1 + x' 

where x = tan 2 (ip/2). A little calculus shows that the minimum value, over all 
x e [0,oo], of the expression on the left occurs at x = m/M and the maximum 
value, over all x € [0, 00], of the expression on the right occurs at x = M/m. This 
gives (|2T4) . If Re{u,v)i > then ip < vr/2 and so x = teai 2 {p>/2) < 1. The 
maximum value on the right now occurs at x = 1, giving (|2.15l) . 

The same analysis, applied to the angles $ and ^ between the lines Cu and 
Cv (or Ru and Rv in the real case) includes the restriction tan 2 ($/2) < 1 and 
gives the right-hand inequality in (|2.16|) . The left-hand inequality follows from 
the right-hand one by interchanging the inner products and (-,-) 2 . Besides 

interchanging the angles cp and ip, this has the effect of replacing m by 1/M and 
M by 1 /to to give 



IMil |(«,«)a| (1/m) 2 - (1/M) 2 M 



r2 



NI1NI1 NI2H2 " (1/m) 2 + (1/M) 2 M 2 + to 2 ' 
Multiplying through by — 1 completes the proof. □ 

In our notation, Dragomir's results from [4] are 

M 2 |( M ,p) 2 | Kt^t^jJ m 2 

m 2 " ||u|| 2 ||v|| a ||«||i||«||i " M 2 ' 
and, if Re(u, v)x > 0, then 

M 2 Re(u,u)2 Re(u,w)i to 2 

1 ~ m*~ - IRRRb ~ \Hi\Hi ~ 1 ~ m*' 



Since 



^ A/ 2 2 M - m < M 2 - to 2 M 2 - m 2 < ^ to 2 

to 2 ~ M + m ~ M 2 + to 2 M 2 + to 2 ~ M 2 ' 
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Theorem 12.81 improves on both of these statements. 

The estimate (|2.14|) . on the difference between the cosines of <p and ip readily 
gives a lower bound on the product of those cosines. 

Corollary 2.9. Let V be a real or complex vector space equipped with inner products 
(•, -)i and (•, -)2- Make the definitions h2.1\) . For independent vectors u and v in V , 

(217} Re(M,t;)i Re(u,v) 2 > f M - to n ' 

||w||i||u||i ||u||2||u||2 ~~ \M + m 
Proof. Let // = (M - to) / (M + to), 

Re(tt, v)i , Ke(u.v) 
and 



IMIilMli' y \\uh\\v\W 

Note that < fi < 1. By the Cauchy-Schwarz inequality and (I2.14j) . the point 
(x, y) lies in the region defined by— 1 < x < 1, —1 < y < 1, and — 2/i < x — y < 2/i. 
Minimizing xy over this hexagonal region easily yields (x, y) = (—fx, /i) or [x, y) = 
(/i, — //). Thus, xy > —fi 2 as required. □ 

3. Formulation in terms of matrices 
The angle 9 between vectors x, y £ C™ is defined by < 9 < tt and 

Rey*x 
cos9= 

\m\\\y\\ 

and the angle 9 between the complex lines Ca: and Cy satisfies < 8 < ir/2 and 

\m\\\y\\ 

Let A be an invertible n x n matrix and consider the two inner products 

(3.1) (x,y) 1 =y*x and {x,y) 2 = {Ay)*(Ax) 

on C™. Then the definitions in (f2Tj) show that M = \\A\\ and 1/m = so the 

condition number of A is k(A) = M/m. Theorem 12 .41 becomes the following. 

Theorem 3.1. Let A be an invertible n x n matrix. For independent x, y £ C™ let 
ip be the angle between x and y and let ip be the angle between Ax and Ay. Then, 

^(Ay 1 tan(</>/2) < tan(^/2) < k(A) tan(<^/2). 

Let X n and Ai denote the smallest and largest eigenvalues of A* A. Then equality 
holds in the right-hand inequality above if and only if x/\\x\\ + y/\\y\\ is in the 
X n -eigenspace of A* A and x/\\x\\ — y/\\y\\ is in the \\-eigenspace of A* A. Also, 
equality holds in the left-hand inequality above if and only if x/\\x\\ — y/||y|| is in 
the X n -eigenspace of A* A and x/\\x\\ +y/\\y\\ is in the \\-eigenspace of A* A. 

Theorem 1 2 . 6 1 gives a concise reformulation of the generalized Wielandt inequality. 
Since n(A) — k{A^ v ), the symmetry between the angles <!> and ^ is clear. 

Theorem 3.2. Let A be an invertible n x n matrix. For independent x,y £ C n let 
$ be the angle between the complex lines Cx and Cy and let ^ be the angle between 
the complex lines C(Ax) and C(Ay). Then 

K(A) _1 tan($/2) < tan(*/2) < k(A) tan($/2). 
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It takes a bit of care to show the equivalence of this theorem with Theorem 
11.11 because the angles $ and "J represent subtly different concepts in the two 
statements. In Theorem 13.21 <& and \t represent angles between given complex 
lines, while in Theorem 11.11 thev represent bounds on those angles rather than the 
angles themselves. Also, one must apply Theorem 1 1.1 1 to A and to A^ 1 (or else to 
x, y and to x, —y) to obtain both sides of the inequality above. 

The conclusion of Theorems 13.11 and 13.21 may be rewritten as 



/o 0A -X + cosw x + cos^ 

(3.2) <cos^< — , 

1 — x cos <P 1 + X cos 

where \ — ( K {A) 2 — 1)/(k(A) 2 + 1). (Of course, ip and ip should be replaced by <& 



and ^ when rewriting Theorem 13.21 ) 

We have omitted the characterization of the cases of equality in Theorem 
but they can be readily obtained from Theorem 12.61 Conditions for equality in 
Theorem 12.41 are simpler than those in Theorem 12.61 because the former deals with 
angles between a single pair of vectors and the latter with an infimum of angles 
between vectors in two one-dimensional subspaces. To recognize when equality 
occurs in Theorem 12.41 one only has to consider the placement of the vectors u and 
v relative to the eigenspaces V m and Vm- But equality in Theorem [51)] requires that 
this infimum of angles be achieved for u and v in addition to requiring their correct 
placement with respect to these eigenspaces. In [9], Kolotilina gave the following 
characterization of the cases of equality in the generalized Wielandt inequality, 
without explicit recognition of this two-stage requirement. We give an alternative 
proof using Theorem 12.61 (Notice that the complex numbers £ and rj appearing in 
the Theorem of [9] are unnecessary as they may be absorbed into the eigenvectors 
x\ and x n .) 

Proposition 3.3. Let B be an n x n invertible Hermitian matrix, suppose \\ > 
A„ > are its largest and smallest eigenvalues, respectively, and set \ — (Ai — 
A n )/(Ai+A„). Fix independent x , y € C n and let cos ip = \y*x\/(\\x\\\\y\\). Then 

(3.3) \y*Bx\ = X + C0SL P ^FBi^ftBy' 

1 + x cos ip 

if and only if 
(3.4) 



- — 77 = —=(\/l + cost^Xi + Jl - cos tp x n ) , and 
\x\\ V2 

y 



| ; - -^=(\/l + costpxi - yjl - cos</32;„) 

for some complex number e of unit modulus and some unit eigenvectors X\ and x n 
satisfying Bxi — X\Xi and Bx n = X n x n . 

Proof. With A = B 1 / 2 we have B = A* A. Apply Theorem 12.61 to the inner 
products (|3.ip and note that M = Ai and m — X n so Vm and V m are the Ai- and 
A n -eigenspaces of B, respectively. Using (|2.10p . we see that (|3.3I) is equivalent to 
equality in the left hand inequality of (|2.7p . Thus, Theorem 12.61 shows that (|3.3p 
holds if and only if (uix, —y) € E and either ct\ = ct2 or y*x — 0. As in Theorem 
EH \y*x\ = aiy*x and \(Ay)*(Ax)\ = a 2 (Ay)*(Ax). 

First suppose that x and y satisfy (|3.4p . A calculation, using the fact that x\ 
and x n are orthogonal, shows that ey*x > and e{Ay)* (Ax) > 0. It follows that 
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either ai = a 2 = e or y*x — 0. Also, 



ex —y 



\\-y\ 

and 

ex —y 



ii^ii II -v\ 



= V2e y/T— cos tp x n 6 V m 
= \/2ey/l + cosipxi € Vm 



so (a 2 x, —y) e £7. 

Conversely, suppose that (a 2 x, —y) € -E and either ai = a 2 or y*a; = 0. Set 
e = a 2 . Then there exist w € Vm, and W € Vm such that 

ex y ex y 

IfII II2/II IMI Hj/II 

Since u> and IV are orthogonal, the parallelogram law gives ||IV|| 2 + ||u>|| 2 = 4 and 
the definition of (p gives ||W|| 2 — ||^|| 2 = 4 cos ip. Solving these two equations 
yields, \\W\\ = \/2y/l + cos(p and ||w|| = \/2y/l - cos 93. With £1 = eW/||W|| and 
x n = ew/\\w\\ we have (|3.4p . This completes the proof. □ 
In Theorem 3 of [15], Yeh gave a different generalization of the Wielandt in- 
equality for angles between complex lines. Here we show that Theorem 13.21 gives 
the stronger inequality. 

Theorem 3.4. [15j Let A be an invertible nx n matrix. For independent 1,1/6 C™ 
let $ be the angle between the complex lines Cx and <Cy and let 'J be the an- 
gle between the complex lines <C(Ax) and <C(Ay). Define 9by0<9<ir/2 and 
cot(0/2) = k{A). 7/cos$ < 1/k(A) 2 , then 

(3.5) cos* < cos6> + 2cos 2 (6»/2)cos$. 

Proof. By Theorem 13.21 and (|3.2[) . it is enough to show that 

y -|- COS $ 

— < cos 9 + (1 + cos 9) cos $, 

1 + x cos $ ~ 

where 

_ n{A) 2 - 1 _ cot 2 (fl/2) - 1 _ 
X ~ n{Af + l ~ cot 2 (0/2) 1 1 _C ° ' 
But both x an d cos 4> are positive, so 
X + cos $ 



1 + x cos <E> 



< x + cos $ < x + (1 + x) cos $ 



as required. □ 
In Theorem 3.1 of [M], Yan generalized the Wielandt inequality for real sym- 
metric matrices as follows. 

Theorem 3.5. |14j Let B be a real n x n symmetric positive definite matrix with 
eigenvalues Xi > X 2 ■ ■ ■ > X n > 0. For independent x, y € M" define <E> by < $ < 
7r/2 and ||x||||j/|| cos$ = |?/ t :e|. Then, 

, , , t , / A;COS 2 ($/2) - A 1 sin 2 ($/2)\ /-= /-= 

(3.6) kc T -By < max- vr^A " 0, , t x T Bx\J y T By. 

V ' 1 \ 2, j A, cos 2 $/2 + Aj sin 2 $/2 7 Vy tf 
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It was left as a conjecture in [14] that the theorem remains true for complex 
vectors x and y and a positive definite Hcrmitian matrix B. 
It is routine to verify that the expression 

s cos ($/2) -isin 2 ($/2) 
scos 2 ($/2) +tsin 2 ($/2) 

is increasing in s and decreasing in t. Thus, the maximum in (|3.6I) is achieved when 
i = 1 and j = n, where it takes the value, 

AiCOS 2 ($/2) - A„sin 2 (<I>/2) _ x + cos$ 

Ai cos 2 ($/2) + A n sin 2 ($/2) ~~ 1 + x cos $ ' 

Here % = Oi/A„ - 1)/(A X /A„ + 1). If A = B 1 / 2 , then n{A) 2 = k{B) = Ai/A„ so 
Theorem l3.2l and (|3 . 2[) implies that Theorem 13 . 5 1 holds in both the real and complex 
cases, confirming Yan's conjecture. 

We end with an improvement of Lemma 2.2 from |llj . It follows directly from 
Corollary 12.91 with (x,y}i = y T Ax and (x,y)2 — y T Bx. 

Lemma 3.6. Suppose A and B are real symmetric positive definite n x n matrices 
and let k = k{A~ x I 2 BA~ x I 2 ). Then for x, y G E" with y ^ 0, 

y T Ax y T Bx > _ f y/R- 1 \ 2 

Vx T Ax^y T AyVx T Bx^y T By ~ xV^+lJ 

The above inequality followed by the AM-GM inequality give the conclusion of 
Lemma 2.2 from |11) : 

^y T Axy T Bx > ( ^fn - 1 \ 2 (x T Ax x T Bx\ 1/2 
y T Ay y T By ~ \Jk+ l) \y T Ay y T By) 

( y/a- 1 \ 2 ( x T Ax x T Bx \ 
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